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Abstract: For the regression model y — X(3 + e where the errors follow 

the elliptically contoured distribution (ECD), we consider the least squares 

(LS), restricted LS (RLS), preliminary test (PT), Stein- type shrinkage (S) 

r — , and positive-rule shrinkage (PRS) estimators for the regression parameters, 

(N; /3- 

^^ , We compare the quadratic risks of the estimators to determine the relative 

\l^ ' dominance properties of the five estimators. 

cn 
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^ \ 1 Introduction 



c^ 



The most important model belonging to the class of general linear hypotheses is the mul- 
tiple regression model. The general purpose of multiple regression is to learn more about 
the relationship between several independent or predictor variables and a dependent or 
criterion variable. 

Consider the multiple regression model 

y = X/3 + e (1.1) 

where y is an n-vector of responses, X is an n x p non-stochastic design matrix with full 
column rank p, /3 = (/3i , • • • , Pp)' is p- vector of regression coefficients and e = (ei , • • • , e„)' 
is the n-vector of random errors distributed according to the law belonging to the class of 
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elliptically contoured distributions (ECDs), £^„(0, a^V, ip) for a G K+ and un-structured 
known matrix V S 5'(?t.), where S{n) denotes the set of all positive definite matrices of 
order (n x n) with the following characteristic function 

(j)e{t) = ^J {aH'Vt) (1.2) 

for some functions ip : [0,oo) — > R say characteristic generator (Fang et al., 1990). 
If e has a density, then it is of the form 

/(e) (X \a'V\-igl^-^e'V-'e^ (1.3) 

where g{.) is a non-negative function over R+ such that /(.) is a density function w.r.t 
(with respect to) a cr-finite measure fi on M^. In this case, notation e ^ £„(0,ct^V,5) 
would probably be used. 

It is sometimes difficult to have complete analysis of the regression model with BCD 
errors of the type (jl.2p or (|1.3p . To overcome such difficulties, one may consider any of 
the three sub-classes of ECDs, namely, 

(i) Scale mixture of normal distributions, 

(ii) Laplace class of mixture of normal distributions, and 

(iii) Signed measure mixture of normal distributions. 

General formula for the above mixture of distributions is given by 



U{x) = / W(i)0AA„(o,t-ia2v)(a^)di, (1.4) 

Jo 

where 4'Af„{o,t-^cr^v){-) is the pdf (probability density function) of ^1(0,^^^17^^). 
(a) If 

W(T)=2(r(7/2))"M^) T-(T+i)e-i?^, 0<7,a2,r<oo (1.5) 



2 
then we have 



m = 7i:hj^^^i^i^^^{^ + ^-^zr'] ^ (1-6) 



(7r7)"/2r (7/2) cr" V 1'^^ 



where E{e) = and E{ee') = ^V = al for 7 > 2. 
(b) Chu (1973) considered 

w(t) = (2^)t|aV|h-fi/:-M/(s)], (1.7) 

£^^[/(s)] denotes the inverse Laplace transform of /(s) with s = [x' {a'^V)^^ x / 2]. 
For some examples of /(.) and yV(.) see Arashi et al. (2010). 

The inverse Laplace transform of /(.) exists provided that the following conditions 
are satisfied. 



(i) f{t) is differentiable when t is sufficiently large. 
(ii) f{t) = o(i-'") as i ^ oo, TO > 1. 

Although, it is rather difficult to derive the inverse Laplace transform of some 
functions, we are able to handle it for many density generators of elliptical densities. 
We refer the readers to Debnath and Batta (2007) for more specific details. 

The mean of e is the zero- vector and the covariance-matrix of e is 

/•OO 

S<, = Cov(e) = / Cov{€\t)yV{t)dt 
Jo 

yV(t)Cou {A/'p(0,i- W)} dt 
/ t-^W{t)dt] a^V, 



(1.8) 

,"'0 / 

provided the above integral exists. 

Comparing the models (|1.3I) and (|1.4p . since S^ = Cov{e) = —2^'{0)a'^V, using 

(|1.6p we can conclude that 

-2i/>'(0) = / t-^W{t)dt. 
Jo 

Now suppose that X ^ £n{lJ-, ^iff)- Then it is important to point out that since 
/ f{x)dx — 1, using Fubini's theorem we have 



i = I I W{t)(t>u^^^^t-^v){x)dtdx 
WW / 4>M„{p.,t-^v){.x)dxdt 

i 

W{t)dt. 



X Jo 

OO 





OO 



Thus for nonnegative function W(.), it is a density. For nonnegative function W(.), 
the elliptical models can be interpreted as a scale mixture of normal distributions. 

(c) Srivastava and Bilodeau (1989) considered the signed measure, W(i) such that 



OO 



(i) / t-^W+{dt) < 







(m) / t"^W'{dt)<(x, (1.9) 



where yV+ — W is the Jordan decomposition of W in positive and negative parts. 
Note that from (i) - {ii) of pT^ . 

r'^W{dt) < OO (1.10) 

and thus, Cov{e) exists under the sub-class defined above. 
This subclass contains the subclass defined by (b). 



Remark 1.1. Regarding the above classifications, we should take the following notes: 
1. In all the above classes we have 



S, = -2^'(0)(tV = (j 



oo 

t-^W{t)dt ) a^V 



resulting in -2V''(0) == j^ t-^W{t)dt. 

2. The subclass (a) is neither contained in subclass (b) nor in the subclass (c). How- 
ever, subclass (b) in contained in the subclass(c). Thus, all the implications about 
the subclass (c) can be used for the subclass (b). 

3. For the subclass (c) we can assure that — 2'0'(O) ~ L t^^yV(t)dt exists. However 
it may not exist for the subclass (h). 

Some of the well-known members of the class of ECDs are the multivariate normal, 
Kotz Type, Pearson Type II & VII, multivariate Student's t, multivariate Cauchy, Logis- 
tic, Bessel and generalized slash distributions. Dating back to Kelker (1970), there are 
many known results concerning ECDs, in particular the mathematical properties and its 
application to statistical inference. These results have been put forward by Cambanis 
et al. (1981), Muirhead (1982), Fang et al. (1990) and Gupta and Varga (1993) among 
others . 

The object of the paper is the estimation of the regression parameters, (3 — (/3i, • • • , j3p)' 
when it is suspected that /3 may belong to the sub-space defined by H(3 = h where H 
is a q X p matrix of constants and /i is a q-vector of known constants with focus on 
the Stein-type estimators of f3 in addition to preliminary test estimator (PTE) based on 
the error distributions belonging to the subclass(c) which includes subclass(b) described 
earlier. 

The recent book of Saleh (2006) dealing with the proposed estimators (chapter 7) 
presents an overview on the topic under normal as well as nonparametric theory covering 
many standard statistical models. Tabatabaey (1995) and Arashi and Tabatabaey (2008) 
covers the theory with spherically symmetric distribution of errors developing many 
interesting calculations. For some systematic work have been done so far in the context 
of Stein-type estimations see Srivastava and Bilodeau (1989) and Arashi and Tabatabaey 
(2009) under elliptical symmetry. 

We organize our paper as follows: Section 2, contains the estimation and the test 
of hypothesis along with proposed estimators of /3, section 3 deals with the bias, risk 
and MSE expressions of the proposed estimators while the analysis of the risks and 
comparisons are presented in section 4. Concluding remarks are presented in section 5. 

2 Estimation and Test of Hypothesis 

In this section, we present the estimate of /3 and ct^ under least square (LS) theory. 
Further we discuss the problem of testing the general linear hypothesis, Hf3 = h. The 
test of this hypothesis covers many special cases considered in practical situations. 
Using standard conditions, it is well-known that the generalized LS (GLS) estimator of 
f3 is 

^ = {X'V''^X)-^X'V'^y = C'^X'y, C = X'V^X. (2.1) 



Under elliptical assumptions, its distribution is £p{(3,a'^C ^,g). Similarly the estimate 
of the (T^ is 

a^^-iy-X~^yV-\y-X~^). (2.2) 



It is easy to show that 



^2_ (y-X~fiyV-\y-X'^) 



gz ^ i£ ci^j V2 :^ (2.3) 

n — p 

is an unbiased estimator of a^ ~ -~2ip'{0)a^. 

For test of HP = h (where q < p), we first consider the restricted estimator given by 

P = p-CH'Vi{Hp-h), Vi = {HCH')^. (2.4) 

It can be directly verified that f3 ^ £p {(3 - CH'Vi{H(3 - h), a^Vz, g) for V2 = C{Ip- 
H'ViHC). Therefore, we get 

E0 -13) = CH'Vi (H(3 - h) 

= -^ (say), 

E0 - (3)' - (3) = -2<j^-^'{Q)tr{V2)+8'S 

= <Tltr{V2)+8'6. (2.5) 

Similarly, under H(3 ~ h, the following estimator is unbiased for a^. 

s*^ ^ iy-m'v-\y-xf3)^ ^^_^^ 

n — p + q 

from least square's theory. 

Now we consider the linear hypothesis H(3 = h and obtain the likelihood ratio test 
(LRT) statistic for the null hypothesis Hq : H(3 = h as well as its non null distribution. 
The LRT statistic is a consequence of the results of Anderson et al. (1986). 

Theorem 2.1. Let 

w == {(3:(3eW,H(3 = h,<7>0,VeS{n)},and 
n = {l3:l3eRP,<T>0,VeS{n)} 

Moreover, suppose y^ g{y) has a finite positive maximum yg. Then under the assumptions 
of model (1.1) the LRT for testing the null-hypothesis Hq : H(3 = h is given by 

_ {Hf3-hyV,{Hp-h) 

La- ^^2 , yi.i) 

and has the following generalized non-central F- distribution with pdf 

, q^\(q+2r) /, |(<?+2r-2) (0) . , 2N 

n* (r \-\^—SjilL ZH "^ ^ *' (0»\ 

'^""^ "^ " 4^0 H B (^, f ) (1 + -£„) ^'«— ' ' ^ ^^ 

where m = n — p, Aj = ^/^'e for 6 = {H jS — hyv i{H (3 — h) , and the mixing distribution 
becomes 



For the proof see the Appendix. 
Corollary 2.1. Under Hq, the pdf of C„ is given by 

1 9 1 

/ 9 \ 2 f2-^ 



9;™(^") = „.„ „..'?' :,.i(,+^) - (2-10) 



which is the central F- distribution with {q, m) degrees of freedom. For W(t) given by 
(jl.Sp . we get the results produced by Tabatabaey (1995). 



Now, consider the calculations of the probability of that £„ < Fa, which gives the 
power function of the test as 



1 / „ N ™ 

2('Z + 2r),- 



where x = , '^ ° ^ and Ix{a,, b) is the incomplete Beta- function, 



(2.11) 



^-(«'^) = ^r^/ u'''\l-u)'-'du. (2.12) 

The function ()2.1ip stands for the power function at a-level of significance and may be 
called the generalized non-central F-distribution cdf (cumulative distribution function) 
of the statistic £„. 

Similarly, the cdf of a generalized non-central chi-square distribution with 7 d.f. may 
be written as 

n;ix; S') = Y, ^K^°H^l)n^+2rix; 0), (2.13) 

r>0 

where 'H^j^2r{x] 0) is the cdf of Chi-square distribution with 7 -H 2r d.f. 

Following Judge and Bock (1978) we may write the following lemma where Ej\f stands 
for the expectation w.r.t normal theory. About new insights. Lemma 2.1 in the below, 
gives possible extension for evaluating specific moments under elliptical assumption. As 
it can be seen from the result of Section 3, it is always needed to obtain the expectation 
of a measurable function for risk functions. To ease the understanding of what Lemma 
2.1 proposes, consider a random elliptical variable X ~ £q{fi,V,g), then for a Borel 
measurable function /i, we have 



E{h{X)) = / Eu{h{X))W{t)dt, 



where Ej\f{h{X)) shows taking the expectation of h under the model Afq{fi, V). 

For the proof of relevant expectations in Lemma 2.1 under normality assumption see 
Judge and Bock (1978) and Salch (2006). 

Lemma 2.1. If w ^ £q{ri,a'^Iq,g), and ^ is a measurable function then 



f-OO 

E[w<P{w'w)] = / Ej^[w<P{w'w)\t] W{t) dt 
Jo 

f-OO 

= V Ej^[^{xl+2{^'t))\t]W{t)dt, (2.14) 

(ii) 

POO 

E[{ww'Mw'w)] = I J E^f[^{xl+2{^'t))\t]W{t)dt 

Jo 

+ v'v / E^i'^ (x'+4(A?)) \t] W{t) dt, (2.15) 
Jo 

(Hi) 

E[{w'A wMw'w)] = tr{A) / E^[^ (x?+2(A?)) |i] W{t) dt 

Jo 

f-OO 

+ (r,A',7) / E^[^ (x^+2(A?)) |i] W{t) dt, (2.16) 
Jo 

where A^ = tri'ri, A e 5(g). 
Further, 

(i) 

E('-'^^[xl+AAl)] = J2 ^Ki'^H^Diq + S-2 + 2r)-\ (2.17) 

(ii) 

E'^^-'^^xl+si^l)] = E liKi^\^l){q + S-2 + 2r)-\q + S-A + 2r)-\ (2.18) 

and for h — Q,l 



r\ 

r>0 



r! 

r>0 



Ki^H^l) = [-2^'{0)r[^) I ^-^, e ''^ '' Wit)dt, (2.19) 

// ^{w'w) = I{w'w < c), where I{A) is the indicator function of the set A, then 

E[w I{w'w < c)] - r, / Em[I (x^+2(A^) < c) \t] W{t) dt 

Jo 

= rjn;+2{c,Al), (2.20) 



(ii) 



(in) 



Further 



E[ww' I{w'w<c)] = Ign*2{c,^l)+riv'n:+^{c,Al), (2.21) 



E[-w'A w I{w'w < c)] = 



tr{A)n;+2ic,Al) 



(2.22) 



(ii) 



(Hi) 



E 



^ f w'w 



= rjc: 



(2) 

g+2,7n 



E 






E 



I A T (w'w 
w Aw 1 \ —-— < c 



q + 2 



c;At] , m = n-p, (2.23) 



.^(2) / g „.a2 



'/'/ "-^g+4,m 



(2) 



g + 2 



-c;A 



= triA)G'^l, 



q + 2 



:c;Al 



(2) / g 

+4,m I ^ ^ 2 



c;A^ 



(2.24) 



(2.25) 



Further, 
(i) 



(ii) 



, qFa 

n - p + gFa 
^ ■ '^ g + s 



g + 2i n — p 

:^ hr, — - — 



n — p + qd 



2J 1 



n-pj B(5±^,^) 



(2.26) 

(2.27) 



(2.28) 
q + s + 2r — 2j n — p + 2j 



In many practical situations, along with the model one may suspect that f3 belongs to 
the sub-space defined by H/3 ~ h. In such situation one combines the estimate of f3 
and the test-statistic to obtain 3 or more estimators as in Saleh (2006), in addition to 
the unrestricted and the restricted estimators of /3. Now, we proceed to define three 
more estimators of /3 combining the unrestricted, restricted and the test-statistic £„ as 
in Saleh (2006). First we consider the preliminary test estimator (PTE) of /3 which is a 
convex combination of f3 and /3: 

- PT 

(3 = f3IiCn > F^) + (3IiCn < F^), (2.29) 

where I{A) is the indicator function of the set A and Fa is the upper a*'* percentile of 
the central F-distribution with {q,m) d.f. 

The PTE has the disadvantage that it depends on a (0 < a < 1), the level of 
significance and also it yields the extreme results, namely and depending on the 
outcome of the test. Therefore, we define Stein-type shrinkage estimator (SE) of /3, as 

0' = + {1 - dC-')C0 - 0) 

= 0-dC-\0-0), (2.30) 

where 

d- ^7'K,t and ,>3. (2.31) 

q(n — p + 2) 

The SE has the disadvantage that it has strange behavior for small values of £„. 
Also, the shrinkage factor (1 — d£~^) becomes negative for £„ < d. Hence we define a 
better estimator by positive-rule shrinkage estimator (PRSE) of /? as 

/S'"^ = + {I ~ dC-^)I[Cn > d]i0 - 0) 

= 0'-{l-dC-')I[Cn<d\{0-0). (2.32) 

Note that this estimator is also a convex combination of (3 and (3. 

The biases, the quadratic risks and MSE-matrices of the estimators are given in the 
following section and the dominance properties are studied in section 4. 

3 Bias and Quadratic Risk of the Estimators 

Consider for a given non-singular matrix W G S{p), the weighted quadratic error loss 
function of the form 

Lif3*;f3) = {0*-0yWi0*-0), (3.1) 

where f3* is any estimator of f3. Then the weighted quadratic risk function associated 
with (|3.ip is defined as 

R{0*;0)=E[{f3*-0yW{0*-0)]. (3.2) 

In this section, we determine the biases, and using the risk function (|3.2I) . evaluate 
the quadratic risks and MSE matrices of the five different estimators under study. First 
we consider bias expressions of the estimators. 



Directly 
Also 
Using Lemma 2.1 we have 



61 = E[^ - /3] = 0. 
b2 = E$ - /3] = S. 



63 = E0''^-f3) 

= E[p-I{Cn<F^)C^-(3)-f3] 

= E[p-f3]-E[I{Cn<F^)Cp-^)] 

= -CH'vY^ E[I{Cn < Fc,)Vy^{H^ - h)] 



Applying Lemma 2.1 we obtain 



, cS 



64 - Ei(3 -(3) 

= E[~^-dC-'CP-^)-/3] 

= -dC-^H'Vy^E[C-^vy^{H^ - h)] 

= -dqSE'^^^lxl+liAl)], 

Finally, by making use of Lemma 2.1 once more, we get 

bs = E0''^-/3) 

= E0'' -13)- E[I{Cn < d)Cp - (3)] + dE[C-H{Cn < d)0 - ^)] 



-dq6E'^'\x*i+2iAl)] + 5G\%^^ {d;A 



qd 

"qT2' 



■6E^^^ 



F-+\^{Al)I {Fg+2MAl) < 



q + 2' 



(3.3) 

(3.4) 



(3.5) 



(3.6) 



(3.7) 



Note that as the non-centrality parameter A^ -^ 00, bi — b^ = 64 = b^ — while 62 
becomes unbounded. However, under Hq : H/S — h, because 6 — 0, bi — b2 — b^ = 
bi = 65 = 0. 



For the risks of the estimators, considering quadratic risk function given by (3.2), we 



get 



Also 



Ji(/3;/3) 



R(P-I3) = E[Cfi-(3)'W(p-l3)] 



E[{f3-f3)'W{f3-f3)] 

altr{V2W)+5'W5 

R{kf3) - altr[WCH'ViHC] + d'WS. 



(3.8) 



(3.9) 
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Note that R = C^ H'ViHCi is a symmetric idempotent matrix of rank q < p. 
Thus, there exist an orthogonal matrix Q (see Searle, 1982) such that 



QRQ' = 







, QCi ^^^WC^^^^Q' = 



All All 
A21 A22 



The matrices An and A22 are of order q and p — q respectively. 
Now we define random variable 

w = QC\''^~p-QC~^''^H'Vih, 



then, w ^ £p{ri,a'^Ip,g), where 



V 



,1/2 



QC{'"l3 QC^^'^'H'Vih. 



-l/2rr/i 



(3.10) 



(3.11) 



(3.12) 



Partitioning the vectors w = {w[,w'2y and 77 = {'ri'i,r]2y where Wi and W2 are sub- 
vectors of order q and p ~ q respectively, we obtain 



^-/3 = C-'/'Q'iw-q) 



Thus, we can rewrite 



Also we get 



/:„ = ^^, 7i'ir^i^e^{H(5-h)'Vi{Hf3-h). 



tr{W[CH'ViHC]} = tr{QC~^/^WC^^^^Q'QRQ'} 

All A12 



(3.13) 
(3.14) 



= tr 



A21 A22 
tr{Aii). 



Iq 





And 



6'W6 = {Hf3-hyViHCWCH'Vi{Hf3-h) 
= »7'i-4iiJ7i. 

Substituting (iXT5|) - (l5TB in (3.9) yields 

R0;(3) = RC^;(3)-altr{Aii)+r][Aiirj^. 

Similarly, we obtain 

R0''^;P) = E[0''^ - (3yW0''^ - (3)] 



(3.15) 

(3.16) 
(3.17) 



= E{[C^ -f3)- I{C„ < F^)C^ - ^)]'VF[(/3 - /3) 

-IiCn<F^)C^-m 
= E[C^ - f3ywC^ - /3)] - 2E[IiC„ < F^)(p - /3)'W^(^ - ^9)] 

+E[i{Cn < F^)C^ - ^ywC^ - ^)] 

= R{hP) - E[w'^AiiWiI{Cn < Fa)] 

-2E[w'2A2iWiI{Cn < F„)] + 2r][AiiE[wiI{Cn < F„)] 
+2T]'2A2iE[wiI{£n<Fa)]. 
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By making use of the representation given by (1.4), we conclude that Wi \ t and W2 \ t 
arc conditionally independent under normal theory, then one may write 

Elw'^A^iWiIiCn < F^)] = / W{t)EN[w'2A2iWiI{Cn < F„)]dt 

Jo 



Therefore, we get 



- PT 



77^^21 / Wit)EN[wiI{Cn < F^)]dt. 

Jo 



R{(3 ;/3) - i?(/3;/3)-aX^n)G^'4„(F„;A2) 

Now consider for every q > 3, one can obtain 

E('nx;ll{Al)]-{q-2)E('\x;lli^l)] - AlE('\xf^:iAl)] 
Then, one can find 



i?0^;/3) 



= E[0'' - fByWi^' - (3)] 

= E[0- f3ywC^ - /3)] - 2dE[C-\~^ -f3YwCP- ^)] 

+d^E[c-^C^-^yw0-m 

= i?(/9;/3) - 2dE[C:^'^{w\AiiWi - q^AnWi + W2A21W1 
-T]'2A2iWi)] + d^E[£-^{w[AnWi)] 

= Rik,f3)-dqcTltriAn)^iq-2)E^'^[x;+2{Al)] 

{q + 2)r][Anqj_' 



^ 2a2A2tr(An) 
Finally, the risk of PRSE is given by 



(2Al)E('nx;l,{Al)] 






(3.18) 

(3.19) 
(3.20) 



(3.21) 



= R{f3 ; /3) + E[{1 - dC-^fl{C^ < d){(3 - (3)'W{(i - /3)] 



-2i?[(l - dC-^)I{Cr, < d)0 - (3YwCfi - p)]. 



(3.22) 



But using the fact that 



E$ -f3yw(l-pC-')I{Cn<d)0-h] 
= E[0 -(3) + {l- dC-')0 - ^)]'Wil - dC-^)I{Cn < d)0 - fi)] 

= E{0- f3yw[ii - dc-^)i{Cn < d)0 - m 
+E[ii - dc-')^i{Cn <d)0- ^yw0 - ^)], 
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we get 



R0'^;f3) 



~^s 



R{fi ; /3) - E[{1 - dC-^fl{Cn <d){fi- fi)'W{fi - ^)] 

-2E{Cfi - (i)'W[{l - dC-^)I{C^ <d){fi- m 

i?(;3''; /3) - E[{1 - dC-^fl{Cn < d)w[AnWi] 

-2E[{1 - dC;^^)I{Cn < d){w[AiiWi - T][AiiWi + W2A21W1 

-jy'aAai'Wi)] 



= Rip';/3) 



-al[tr{A,,)E^'^ 



(1 



qd 



qd 



(1- 



qd 



qd 



q + 2 
(1 - -^F-,\^iAl))I{F,+2,rn{^l) < 



q + 2' 



(3.23) 



4 Comparison 

Providing risk analysis of the underlying estimators with the weight matrix W, from 
Theorem A. 2. 4. of Anderson (2003), we have 



or equivalently 



echi{Aii) < J7iAiirji < OchqiAii) 



crlAlchiiAn) < v'lAnT], < crlAlchg{An), 



where chi{Aii) and chq{Aii) arc the minimum and maximum eigenvalue of An respec- 
tively and Al = O/al- 

Then by (3.8) and (3.17) one may easily see that 



i?(/3;/3) > Ri(3;f3)-aitriAii)+aiAichiiAii), 
R{^;f3) < R0;/3)-altr{An)+>ylAlchgiAn). 



(4.1) 



When Al is equal to zero, we have the above in equalities. Thus the restricted estimator 
(/3) dominates the generalized least square estimator (/3) denoting by /3 ^ /3 whenever 



A^ < 



while /3 ^ /3 whenever 



At > 



trjAn) 
chq{Aii)' 



tr(An) 
chi{An)' 



(4.2) 



(4.3) 
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For W = C because tr{Aii) = qwe conclude that y9 performs better than 0^0) in 
the interval [0,q<7^) and worse outside this interval. 



-PT 



Comparing (3 versus /3, using risk difference, we have 



■ PT 



R{(3-(3)-R{(3 ;/3) = aXAn)G^'^^2,„ (F„; A^) 



(2) 



-il[A,,r^,[^G\%,„, {F^;Al) - G\i^^^ (F„; Aj)] 



It follows that the right hand side of (j4.4p is nonnegative whenever 



A^ < 



G 



(1) 

g+2,m 



[Fo^-Al) 



tr(An) ^ 



(4.4) 



(4.5) 



- PT 

and vice versa. Also Under Hq : H(3 = h, (3 h /3. 

-PT 

Now we compare (3 and (3 by the risk difference as follows 



-PT 



Rif3;(3)-R{f3 ;/3) = -aXAn)[l - G^'^ ,„ (F„; A^)] 



ViAn»7i[l-2G^^4,„(i^„;A2) 



+^9+4,m (-?"«; ^*)]- 



PT 



Thus (3 y (3 whenever 

ir(An) 



At < 



1-G 



(1) 

q-\-2,m 



{F.;Al) 



ch,{An) 1 _ 2G(%.„ {F^;Al) + Gfl,„, (F„; A^) 



(4.6) 



(4.7) 



-PT 



and vice versa. However, under Hq, the dominance order of /3, /3 and j3 is as follows 

(4.8) 



-PT 

f3hf3 hf3. 



In order to determine the superiority of /3 to /3, it is enough to see that the following 
risk difference 



R{kf3)~Rif3 ;/3) 



is positive for all A such that 



aldqtriA,^)^iq-2)E'^'^[xllliAl) 
{q + 2)rj[Aiir]^ 



1- 



2a2A2tr(An) 



tr{An) ^ g + 2 



c/i,(Aii) 



(2A^)i?(2)[x;;4(A^)], (4.9) 



(4.10) 



which asserts J3 uniformly dominates (3. 

Further, we show that the shrinkage factor d of the Stein-type estimator is robust 
with respect to (3 and the unknown mixing distribution. 
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Theorem 4.1. Consider the model (jl.ll) where the error-vector belongs to the ECD, 
i5„(0, cr^V, g). Then the Stein-type shrinkage estimator, (3 of (3 given by 



=$-d*C-\~^-0) 



uniformly dominates the unrestricted estimator (3 with respect to the quadratic loss func- 
tion given by (|3.ip for W — C , and is minimax if and only ifO<d* < ^™ . The largest 
reduction of the risk is attained when d* = *"„ . 

■' 771+2 

For the proof see the Appendix. 
Remark 4.1. Consider the coefficient d given by (2.31). From q> i, we get < d = 



< 



^•s 



and thus using Theorem, 4^.1, (3 in equation (2.30) uniformly dominates 



9(771 + 2) ^ 777 + 2 

(3 on the whole parameter space under quadratic loss function. 



To compare /3 and (3 we may write 

R{k,fi) + <yXAn) - ri'iAnVi - dqaitr{An)Uq - 2)E^^\x;+2i^l)] 
{q + 2)'n[AnVi 



R0'';(3) 



1- 



2alAltriAn) 



{2Ai)E^'>[xl+,{X 



(4.11) 



Under Hq, this becomes 



while 



R0';f3) 



R{f3;f3) 



R0; (3) + alii - d)tr{An) 



R{k, (3) - altriAn) 
<R{k(3). 



(4.12) 



(4.13) 



Therefore, f3 performs better that (3 under Hq. However, as rji moves away from 0, 

~ s 
rjiAiiTji increases and the risk of (3 becomes unbounded while the risk of (3 remains 

below the risk of /3; thus /3 dominates (3 outside an interval around the origin. 
Consider under Hq 



PT 



R{(3 -,(3) = R{(3 ■f3)+aitr(Ai^)[l-. 



-PT 



>i?(/3 ;/3), 



(4.14) 



for all a such that F, 

■PT 



(7+2 777 i'^^ 0) — ^4^ • This means the estimator does not always 

dominates /3 under Hq. 

Thus, under Hq with a satisfying F7^2 ml'^iO) < ^r^, we can conclude that 



/3h/3 h(3 hf}. 



(4.15) 



15 



-8+ - S 

We compare, the risks of /3 and (3 . The risk difference is given by 



. ^s+ 



R0 :f3)-R0 ;/3) = -<jlhr{An)E 



r(i) 



(1 - ^n+V™(Aj))'/(^.+2,™(Aj) < ^: 



(1 - :^F-_,\^iAl))I{F,^,M^l) < -^) 



g + 2 3 

The right hand side of the above equality is negative since for _Fq+2,m(A^) < -^, 
(-2j-jFq+2,m(A^) — 1) > and also the expectation of a positive random variable is 

positive. That for all fB, R0'^^;l3) < R0^:l3). And using Theorem 4.1, R{J3^^][5) < 
Rik(3). 

To compare and jS , first consider tlie case under Hq i.e., 7]^ = 0. In this case 



R0 ;/3) = R0:f3) + c7ltr{Aii)\{l-d) 



qd 
q~+'2 

qd 
qT2 



(1 - -^F;+2.,nm'liF,+2,m{0) < '^'^ 



q + 2 -^^ 



q + 2- 



>R0;(3), 



(4.16) 



smce 



eW 

<i?(i) 



(1 - ^n+2,™(0))'^(^<^+2,™(0) < ^) 



(i-^n-A™(0))' 



1-d. 



(4.17) 



^ 5+ 
Thus under Hq, (3 ^ (3 . However, as rii moves away from 0, ri[Aiirii increases and 

the risk of (3 becomes unbounded while the risk of /3 remains below the risk of /3; thus 

~ s+ 

j3 dominates (3 outside an interval around the origin. 

Now, we compare (3 and (3 . When Hq holds, because G*_|_2^m(-^aj0) = 1 — a, 
R{j3 ■,/3) = R{(3 ■(3)+altr{Aii)\l-a-d 



_£;(!) 



(1 - T^^-+V™(0))^/(f,+2,™(0) < ^) 



- PT 

>i?(/3 ;/3), 



(4.18) 



for all a satisfying 



(1 - ^^,+V™(0))^/(i^«+2,™(0) < ^) 



< 1 - a - rf. 



(4.19) 



16 



- S+ , ^ PT 

Thus, /3 does not always dominates /3 when the nuU-hypothesis Hq holds. 

^ s+ ^ s ~ . . - 

Since always /3 ^ /3 ^ f3, and under Hq, the restricted estimator /3 performs 

better that all others, the dominance order of the five estimators under the null hypothesis 

Hq, can be determined under the following two categories 

^ PT ^ 5+ - 5 - 

1. (3t(3 h/3 >I3 >I3, satisfying (4.f9), 

2. ;3 h fi'^ hh' hfi'"^>~^, ya 3 F-\^{d, 0) > ^. 

5 Concluding Remarks 

In this paper, we proposed five different estimators for the regression parameters of a 
linear regression model. In this approach, the prior non-sample information Hf3 — h is 
suspected. Based on the two ordering superiority categories of the estimators, under the 

-5'+ - S 

constraint Hf3 = h, two non-linear estimators /3 and /3 perform better than unbiased 
GLSE. 

The behavior of Stein-type estimators are restricted by the condition q > 3. However, 
The PTE requires the size of testing Ho : H(3 = h. For W{t) as dirac delta function, the 
maximal savings in risk for the shrinkage estimator is "^(^2) ' '^tiil^ for yV(t) as inverse- 
gamma function, it is equal to ™^^~„j ^^^, where i' is d.f. of multivariate student's 
t-distribution. 

Remarkably, the behavior of all estimators comparing with each others under elliptical 
symmetry are exactly the same as under normal theory as exhibited in Saleh (2006). 
This phenomenon shows the dominance order of estimators and regarding substantial 
conditions under normal theory are significantly robust. 

Another noteworthy fact is that under the subclass (c) of elliptical models we could 
present fundamental Theorem 4.1 under elliptical models rather than scale mixture of 
normal distributions. It is important to point out that just under the signed measure 
yV{t) under subclass (c) we are able to prove Theorem 4.1 even for non-positive measures. 
See Srivastava and Bilodeau (1989) for more discussion in this regard. 

Finally, if ui remains constant, the distribution of F^-^,^^ tends to that of X^i/^^i as 
1^2 tends to infinity (see Johnson and Kotz, 1970). 
Now consider the class of local alternatives {K(n-)} defined by 

K(^,,y.Hl3 = h + n^^. (5.1) 

Furthermore, following Saleh (2006), consider the following regularity conditions hold 

(i) maxi<i<„ x'^{X'V~^X)~^Xi -^- as n — > 00 

where x'^ is the ith row of X; 

{ii)\im^^oo{n-HX'V-^X)} = C 

for finite C e S{p). 

Then using Theorem 7.8.3 from Saleh (2006) in addition to Theorem 2.1 we obtain the 

following important result for the test statistic 

lim P{Ln<x) ^n*Jx;5^), 
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Based on the above results, one can easily obtain the asymptotic distributional bias, risk 
and MSE matrix of each estimator under study using the following definition 

Gp{x) = lim Pk,„,{\/^5-2(/3* - /3) < x). 

Then 

b{f3*) ^ JxdGpix), M{f3*) = Jxx'dGpix), R{f3*;f3) = tr[WM{f3*)], 

which have similar notations to those are given in this paper. 



6 Appendix 

Proof of Theorem 2.1 

The likelihood ratio is given by 



A 



max^L{y) 
maxnL{y) 

dn\<7^V\^^^^maXyg 



dJd^V\-yh 



Eyg 



(y-Xf))'V-\y-Xp) 
25^ 



~ 2 \ "/2 



givg) 



2^2 



^/2 



Therefore 



-, (HI3-h)'{HC-^H')-'^{H(3-h) 
"^ (y-XpyV-Hy-Xp) 



y2ln 



1 



1 + ^/:. 

n—p '' 



which is decreasing with respect to Cn- Now consider that 
{n-p)S^\t = {y - Xfi)'V-\y - X~m 

= y'\(v-^ -V-^X{X'V-^X)-'X'V-^ ]\y\t^Xn-p 

also {HC"^H')"^/^{H~p)\t ^ Nq{{HC~^H')-^/^{H^ - h),t-^<7^Iq). Then 

where A2 = ^. Also using the fact that (y-X/3)'F"^(y-X^)|t and ^'if'(ifC"^if')"^-H'(9|i 
are independent, we get 



Cn\t 



^'H'(HC-^H')-^HJ3\t 



K 



q,n—p.Af 
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Hence 



/•OO 

Jo 



7^0 r(r+l) \m 



C, 



i+r-l 



S(£+^,^)(l + X£„)^+r 



dt 



E - 



d^'-^Kf\^l) 



B(|+r,f)(l + ^/:„)^+- 



Proof of Theorem 4.1 

By making use oi z — H'Vi{H/3 — ft,), the SE can be rewritten as 

= ^-qd*S^{z'C-^zy'^C-^z. 
Then, the risk difference of the SE and the UE under quadratic loss function, is given by 



{d*)^E\q^S^z'C-'zy^' 



= id*fEt\E_ 



^w 



q\S* {z'C-'z) 



-2d*E 

-1 



qs^z'c-^z) y^-(3yz 



-2d*Et i E 



t \ ^N 



,-1 ,^ 



qS^{z'C-^z) {(3-(3yH'Vi{H(3-h) 



g2(m + 2) 2 



m 



w-ri^.l-.^^ -2,V£, ). 



mS' 



t - t-^xl^ and ^'h'ViH^ \ t - t-^cr^xH^), where d = /S'H'ViHjS. 



Therefore, I>4 < if and only if < d* < ;^ since J^ jt^^t- dWit) > 0. (See 
Srivastava and Bilodeau, 1989). 
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